Corpus-based empirical analysis of form, function and frequency of characters used in Bangla

نویسندگان

  • Niladri Sekhar Dash
  • Bidyut Baran Chaudhuri
چکیده

In this paper an attempt is made to understand formal and functional aspects of Bangla characters used in the written texts compiled in a sample monitor corpus designed systematically from language data collected from various text documents published within 1980 and 1995. The purpose of this study is to understand the form and function of the characters, trace their behavioural peculiarities, and if possible, find out the reasons of such peculiarities. The study focuses on the formation of the characters, their structural change in case of compound and cluster formation, their contextual use, statistical analysis of their occurrence, and their position in words. The study also encompasses the use of different punctuation marks in the texts. Finally, some possible areas of application of such analysis are identified.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Genetic Diversity and Nutritional Components Evaluation of Bangladeshi Germplasms of Kidney Bean (Phaseolus vulgaris L.)

Considering the crucial focus on plant developments as high yielding, protein, and disease-resistant varieties, in this study, the genetic diversity and nutritional traits of available kidney bean germplasms found in Bangladesh have been evaluated based on seventeen quantitative and six nutritional traits. Analysis of genotypic, phenotypic variance and covariance showed that higher environmenta...

متن کامل

Selection of Dwarf Stature Yield Potential Lines from F3 Populations of White Maize (Zea mays L.)

'Dwarf stature' maize variety offers promises to withstand unfavorable growth environments of Kharif season. But, for developing such variety, dwarf stature inbred lines must be available. Here, twenty-four F3 populations of white maize were evaluated though assessment of their genetic variability, heritability, and character association for selection of dwarf stature promising lines based on y...

متن کامل

Formant Analysis of Bangla Vowel for Automatic Speech Recognition

To provide new technological benefits to the mass people, nowadays, regional and local language recognition draws attention to the researchers. Similarly to other languages, Bangla speech recognition scheme is demandable. A formant is considered as the resonance frequency of vocal tract. Formant frequencies play an important role for the purpose of automatic speech recognition, due to its noise...

متن کامل

The Vocabulary Profile of Iranian English Teaching School books

This paper provides a fairly detailed corpus-based vocabulary profile of the Iranian EFL books used in public schools. To this end, the WordPerfect files of all the seven books were converted to text format to get rid of the formatting features and be compatible with the software used for analysis. The software tools used were the Compleat Lexical Tutor suite, version 6.2 (Cobb, 2011), AntConc ...

متن کامل

Developing a Corpus-Based Word List in Pharmacy Research ‎Articles: A Focus on Academic Culture

The present corpus-based lexical study reports the development of a Pharmacy Academic Word List (PAWL); a list of the most frequent words from a corpus of 3,458,445 tokens made up of 800 most recent pharmacy texts including research articles, review articles, and short communications in four sub-disciplines of pharmacy. WordSmith (Scott, 2017) and AntWordProfiler (Anthony, 2014) were used to sc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001